Entropic inequalities as a necessary and sufficient condition to noncontextuality and 

locality 
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The assumption of local realism in a Bell locality scenario imposes non-trivial conditions on the 
Shannon entropies of the associated probability distributions, expressed by linear entropic Bell in- 
equalities. In principle, these entropic inequalities provide necessary but not sufficient criteria for 
the existence of a local hidden variable model reproducing the correlations, as, for example, the 
paradigmatic nonlocal PR-box is entropically not different from a classically correlated box. In this 
paper we show that for the n-cycle scenario, entropic inequalities completely characterize the set 
of local correlations. In particular, every nonsignalling box which violates the CHSH inequality - 
including the PR-box - can be locally modified so that it also violates the entropic version of CHSH 
inequality. As we show, any nonlocal probabilistic model when appropriately mixed with a local 
model, violates an entropic inequality, thus evidencing a very peculiar kind of nonlocality. As the 
n-cycle captures equally well both the notion of local realism introduced by Bell and that of noncon- 
textuality presented by the Kochen-Specker theorem, the results are also valid for noncontextuality 
scenarios. 

PACS numbers: 03.65.Ud, 03.67.Mn 



I. INTRODUCTION 

The quantum nonlocal correlations that may arise in 
experiments performed by spacelike separated and inde- 
pendent observers, are a key concept in the foundational 
aspects of quantum mechanics. The expected classical in- 
tuition that physical quantities have well-established val- 
ues previous to any measurement and that signals can- 
not propagate instantaneously, do not suffice to repro- 
duce the quantum mechanical predictions [l|, highlight- 
ing a very counterintuitive aspect of quantum theory that 
has received strong experimental corroboration over the 
years 0]. From an applied point of view, nonlocality is 
now recognized as a novel physical resource, which en- 
ables protocols such as device-independent quantum key 
distribution 0], random number generation Q and the 
reduction of communication complexity in distributcd- 
computing scenarios || . 

In practice, nonlocality is witnessed through the viola- 
tion of a Bell inequality [l[ . Given a certain experimen- 
tal scenario defined by the number of spatially separated 
parties, the possible different measurement settings for 
each party, and the possible outcomes for each setting, 
local-realistic joint probability distributions form a con- 
vex set to which Bell inequalities, a set of linear inequal- 
ities of the probabilities, are the non-trivial facets @. 
This geometric approach provides a general framework in 
which Bell inequalities can be derived, since the task to 
find the facets of a convex set is a linear program that can 
be solved efficiently. The problem is that, generally, the 
size of the linear program grows very fast as the nonlocal- 
ity scenario becomes less simple, some classes even being 
known to be a NP-complete problem Q . In spite of that, 
some particular characterizations are well known. The bi- 
partite scenario with two dichotomic measurements per 
party is completely characterized by the Clauser-Hornc- 
Shimony-Holt (CHSH) inequality [7j and a generalization 



of CHSH to more outcomes is provided by the Collins- 
Gisin-Linden-Massar-Popescu (CGLMP) inequality 
the CHSH and CGLMP inequalities fully describing the 
set of local correlations up to 3 outcomes @. However, 
for a number of outcomes larger than 3, a complete char- 
acterization of the inequalities bounding the set of local 
correlations is still to be found 0, [l(| , highlighting the 
difficulty and limitations of this approach. 

In a conceptually different approach introduced by 
Braunstein and Caves [ll|, it was shown that local re- 
alism imposes non-trivial conditions already on the level 
of the Shannon entropies. The Shannon entropies carried 
by the measurements on two distant systems must satisfy 
certain inequalities, which can be regarded as entropic 
Bell inequalities. It was recently pointed out that the 
characterization of the local correlations on the entropic 
level, also defines a linear programming problem fl2l[l3l]. 
One advantage of this entropic approach is that it can 
readily be applied to quantum systems of arbitrary local 
dimension and general measurement operators, since the 
inequalities do not depend on the number of outcomes 
of the measured observables. That is, while the dimen- 
sion of the set of local correlations in terms of probabil- 
ities grows exponentially with the number of outcomes 
for each observable, the entropic set of local correlations 
is independent of that. Another advantage of the en- 
tropic approach is that it easily adapts to situations of 
additional independence requirements, like the bilocality 
scenarios introduced by a entanglement swapping exper- 
iment 3 ^1 and general correlation and causal model 
scenarios [16l4l8l | . The independence constraints are non- 
linear on the level of probabilities, defining a nonconvex 
set, while in terms of entropies such constraints are linear 
and still define a convex set that can be solved by linear 
programming. In spite of their attractive properties, en- 
tropic Bell inequalities are, in principle, sufficient but 
not necessary conditions to witness nonlocality. That is, 



there are nonlocal distributions violating a Bell inequal- 
ity, that, however, do not violate its entropic counter- 
part [l3|. However, as we show in this paper, the situa- 
tion is more involved than initially thought, as entropic 
inequalities can, at least in some scenarios, completely 
characterize the set of local correlations. 

In this paper we show that in the n-cycle scenario, 
any nonlocal distribution when augmented with shared 
randomness will also violate a entropic Bell inequality. 
The n-cycle can be seen as generalization of the CHSH 
scenario for an arbitrary number of obscrvablcs for each 
party, with only a subset of pairwise observables being 
jointly measurable (Fig.Q}. The n-cycle captures equally 
well both the notion of local realism introduced by Bell [l[ 
and that of noncontextuality presented by the Kochen- 
Specker (KS) theorem [Tj|. A complete characterization 
of the n-cycle in terms of an exponential number of tight, 
linear inequalities has recently been found (2p| . As we 
show here, the n-cycle can be equivalcntly described by 
a polynomial number of entropic inequalities and a list 
of local/noncontextual points lying in the facets of the 
corresponding set of correlations. 
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FIG. 1. (Color online) Graphical representation of the n-cycle. 
The vertices represent different observables and the edges con- 
nect observables that are jointly measurable, a. CHSH sce- 
nario, 2 parties with 2 measurement settings each. Labelling 
Xi = Aq, X[i = A\, X2 = Bq, X4 — Bx, we recover the usual 
picture where two parties, Alice and Bob, perform space-like 
separated measurements. Alice measures one out of two pos- 
sible measurement settings Aq or Ax, and similarly for Bob. 
b. KCBS scenario, 5 observables arranged in a cyclic configu- 
ration such that each observable is compatible with its neigh- 
bors c. Generalization of the CHSH/KCBS scenario, with n 
observables in a cyclic configuration (the n-cycle [22|]). For 
dichotomic observables, the set of local/noncontextual cor- 
relations is completely characterized by Q. For a general 
number of outcomes, the unique nontrivial entropic inequali- 
ties are those given by ([5]). 



II. THE N-CYCLE SCENARIO 

The n-cycle scenario is defined for any number n > 3 
of observables Aq, . . . , X n , imposing the restriction that 
only Xi and A^ + i are pairwise jointly measurable for all 
i = l,...,n (with X n+ k = A^). Any two observables 
Xi and Xi + \ are jointly measurable, or compatible, if 
the result for the measurement of Xi, even if not per- 
formed, does not depend on the prior or simultaneous 
measurement of Xi + \ and vice versa. This is the notion 
of noncontextuality captured by the KS theorem [l{J. 
It can readily be turned into the notion of Bell's local- 
ity [![, where the compatibility and noncontextuality of 
the observables is assured by spacelike separation of lo- 
cal measurements of different particles. In particular, for 
n = 4, the n-cycle corresponds to the CHSH Bell sce- 
nario [3 (Fig. [TJl) , while for n — 5 it is the noncontex- 
tuality scenario considered by Klyachko-Can-Binicioglu- 
Shumovsky (KCBS) HJ (Fig.[Tb). For general n, it can 
be visualized as an n-sided polygon (Fig. [lb) . 

Any correlation that can be reproduced by means of 
a noncontextual/local hidden variable model is a convex 
sum of the deterministic probability distributions, that 



p(x l x i+ i\X i X l+ i) = g(X)p(x l \X i , X)p(x i+1 \X i+1 , A), 

A 

(1) 

where Xi stands for the outcome Xi = 0, • • • , d — 1 of the 
corresponding observable Xi. The sum is performed over 
all the d n deterministic distributions parameterized by A 
(with a distribution g(X)), that is, all the distributions 
that given a certain observable Xi yield with probabil- 
ity 1 a certain outcome Xi. The noncontcxtual/local de- 



terministic points define a convex set, to which the non- 
contextuality/Bell inequalities are the non-trivial bound- 
aries. 

Similarly, the set of allowed nondisturb- 
ing/nonsignalling distributions also define a convex-set. 
Nondisturbing/nonsignalling distributions are defined as 
the ones for which 



p(xi\Xi) = p{x l Xi + i\X i X i+ i) 



Xi+l 



p{x l x i _i\x l Xi^i), 



(2) 



that is, the outcome of Xi cannot be affected by which 
compatible observable (A^_i or Xi + \) it is jointly mea- 
sured with. For dichotomic observables X4 = 0, 1 the 
vertices of the nondisturbing/nonsignalling set are given 
by M 



Pmax (XiXi+l\XiX l+1 ) 



1/2 , Xi © X l+ i = £-l, 7i 

, otherwise 



(3) 

where stands for addition modulo 2,7 = {71, . . . , 7„}, 
7, = { — 1,1} and the total number of 7.; = —1 is 
odd. Note that for n = 4 and 7 = {1,1,-1,1} this 
represents the maximally nonlocal distribution allowed 
by nonsignalling, the so called PR-box [23| . Given a 
certain nondisturbing/nonsignalling probability distribu- 
tion, any other distribution than can be achieved from it 
using local reversible transformations is said to be equiv- 
alent. Local reversible operations consist of operations 
of two types: relabelling of the observables that preserve 
the mutual compatibility of the joint observables XiXi + i, 
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eg, Xi — > X i+2 , or relabelling of the outputs (possibly 
conditioned on the observable), e.g, Xi — > Xi © 6^1. 

Recently, the full characterization of the n-cycle has 
been found for dichotomic observables [2(|. There are 
2™~ 1 tight equivalent inequalities given by 



CZ = ^2'y i (X i X i+1 )<{n-2) 



(4) 



where {X t X 3 ) = p^XiX^+p^XiX^-p^XiXj)- 
p(\0\XiXj) stands for the expectation value of the ob- 
servable XiXj and again the total number of 7, = —1 
is odd. A noncontextual/Bcll inequality is said to be 
equivalent to another one, if it can be obtained from it by 
local reversible operations and/or permutation of observ- 
ables. Note that p^^ maximally violates C^, achieving 
C^ipZiax) = n J while not violating any other equivalent 
inequality. In the following we will generally refer to C n 
and Pmax as the ones with 7^ = 1 for i = 1 , • • • , n — 1 but 
7n = -1- 

Similarly, the complete cntropic characterization of the 
noncontextual/local set of probability distributions has 
been found for the n-cycle [12l . [l3[ (see also [24| for the 
?i = 5 case). A probability distribution in this scenario 
is entropically noncontextual if and only if the set of n 
equivalent Braunstein-Caves (BC) entropic inequalities 

BC* = H (X k X k+1 )+ ]T HiX^-J^HiXjXj+jKO 

jjt k, k+i j^k 

(5) 

hold for all k = l,...,n where H(XiXj) = 
J2 Xi , Xj -p( x i x j\XiXj)\og 2 p(xiXj\XiXj) is the Shannon 
entropy of the probability distribution associated with 
the measurements Xi and Xj. This set of cntropic 
inequalities is said to be maximal in the sense that 
no other entropic inequality can detect the contextual- 
ity/nonlocality not detectable by it, so that this set of 
tight entropic inequalities completely characterizes the 
region of noncontcxtual/local probabilistic models in en- 
tropy space. 

In order to transform an entropic inequality into an 
equivalent one, the only symmetry operations that can 
be applied are permutations of the observables. It is a 
basic feature of the Shannon entropy its invariance un- 
der permutations of the sample space. Violations of BC 
inequalities witness then a very peculiar kind of contextu- 
ality/nonlocality. If a probabilistic distribution violates 
the BC inequalities, then so does any other distribution 
obtained by the permutation of the outcome probabil- 
ities, provided that the permutation leads to the same 
marginal distributions. This leads to the following phe- 
nomenon: From the point of view of the entropic inequal- 
ity BC*, the maximally contextual/nonlocal distribution 
Pmax i s n °t different from a classically correlated and non- 
contextual/local distribution, 



Pc ( x i 



x,x 



1/2 , Xi © x i+ i = <5_ 
, otherwise 



where 7- = ji for i ^ k but Y k = — 7^. Note that has 
an even total number of 7^ = — 1 and does not violate any 
inequality. Thus, since p max is entropically equivalent 
to a noncontextual/local distribution, it does not violate 
any entropic inequality. In this sense, an entropic non- 
contextual/Bcll inequality is a necessary but not suffi- 
cient criterion to probe the noncontextual/local behav- 
ior a distribution. However, as we show next, entropic 
inequalities can be turned into a necessary and sufficient 
condition, since any contextual/nonlocal distribution vio- 
lating CJ also violates the entropic BC inequalities when 
properly mixed with a noncontextual/local distribution. 



III. ENTROPIC INEQUALITIES COMPLETELY 
CHARACTERIZE THE N-CYCLE SCENARIO 
WITH DICHOTOMIC OUTCOMES 

First, note that the maximum violation of the BC in- 
equality for dichotomic observables is given by BC* = 
1, since H(Xj) < H(XjX j+1 ) and H(X k X k+1 ) < 
H(X k ) + H(X k+1 ) < H{X k+n ^X k ) + H(X k+1 ) < 
H(X k+n _iX k ) + 1. The maximal violation of BC 1 ^ = 1 
can be achieved by the probability distribution 



PEmax — 2 ^max + Pc 



(7) 



for all 7 such that 7^ 



T, since H(Xj) 



HiXjXj+x) = 1 for all j ^ k and H{X k X k+l ) 
That is, a convex combination of two nonviolating distri- 
butions may violate an entropic inequality, highlighting 
the strongly non- linear character of it. Note that p^ can 
be achieved with 1 bit of shared randomness, that is, 
Xi = Xi + i = or Xi = Xi + \ = 1 with the same proba- 
bility 1/2 for all i = 1, . . . , n. Similarly the convex com- 
bination in ([7]) also requires 1 bit of shared randomness, 

outputting p m ax or Pc ' both with probability 1/2. When 
augmented with some shared randomness, it is possible 
to turn an entropically noncontextual/local distribution 
into a contextual/nonlocal one. In a noncontextual/Bcll 
scenario, shared randomness is always an available and 
valid resource. However, for usual noncontextual/Bcll in- 
equalities, the mixing with a local point cannot improve 
the violation of the inequality due to its linearity. 

As we show next, the mixing with pj^ is sufficient 
to entropically detect any contextual/nonlocal distribu- 
tion. First we note that in the CHSH scenario (n = 4 
in the n-cycle), it is known that any probability dis- 
tribution can be transformed into an isotropic distri- 
bution through a local dep olarization process, keeping 
the C4 value invariant [2a]. The isotropic distribution 
has the property of being invariant under the inter- 
change of the inputs or outputs and being locally un- 
biased, that is, pi(x l x i+ i\X l X l+1 ) = pi{xi +1 Xi\XiX i+1 ) 
and p\(xi\Xi) = 1/2. For the n-cycle, the isotropic dis- 
tribution p\ corresponds to a probabilistic mixture, 



(0) 



Pi = epmax + (1 - e)Pw, 



(8) 
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with p w (xiXi + i\XiX i+ i) = 1/4 being pure white noise. 
The corresponding C n value is given by C n {pi) = ne, that 
is, pi is contextual/nonlocal for e > (n — 2)/n. From the 
entropic point of view p\ is equivalent to the distribution 
(Pc + (1 — e)p w and thus no direct violation of entropic 
BC inequalities is possible. 

The first step in our proof is to show that any distribu- 
tion in the n-cycle scenario can be turned into a isotropic 
one without changing the values of C n , that is, a gener- 
alization of the depolarization protocol devised in Rcf. 

(see also (2(|) for the n = 4 case. Then, for our 
purposes and to simplify the presentation it is enough to 
consider isotropic boxes only (however as shown in the 
Appendix this is not strictly necessary). The depolariza- 
tion procedure is done in two steps: 

i) p{xiXj \XiXj) is made locally unbiased by flipping 
both outputs simultaneously with probability 1/2, that 
is, Xi — > Xi © 1 and Xj — > Xj © 1 

Xj®nk) where x\ 



u)p(xiXj\XiXj) -» y^p( 



X ^ X j I J£ 



means flipping the output if i 6 { 1 , . . . , n — k + 1 } and 
ffi„ stands for addition modulo n. After the second step 
the initial distribution is in the isotropic form, however 
maintaining the value of C n unchanged. 

Computing the BC value for the isotropic distribution 
mixed with the classically correlated box, that is, vp\ + 
(1 — v)pc, expanding around v — we find that 



BC„ 



_[/(n,e)-(2-n(l-e))ln U ] 



(9) 



with/(n,e) = 2-n(l-e)(l+ln 2) + (n+e-ne) In (1 - e)- 
eln(l + e) + ln(4/(l - e 2 )). For any 2-n(l-e) > tak- 
ing a sufficiently small v ensures BC to be positive since 
f(n, e) does not depend on v. That is, for any nonlocal 
isotropic distribution e > (n — 2)/n the BC inequality 
can be violated. This is the same bound obtained by the 
direct calculation of C n (pi). Given that any distribution 
can be turned into the isotropic box without changing its 
C n value, this means that any contextual/nonlocal distri- 
bution in the n-cycle scenario also violates the entropic 
BC n inequality when properly mixed with a classically 
correlated and noncontextual/local distribution. 

We note that we have analyzed the specific case of 
a distribution violating a specific equivalence of C% 
where all 74 = 1 but 7„ = —1. However, the gen- 
eralization for distributions violating other equivalences 
of is straightforward. Since there is a one-to-one 
correspondence between the vertices of the nondisturb- 
ing/nonsignalling set and each of the facets of the non- 
contextual/local set, if a distribution violates C^, it can 
be brought to the isotropic form pj = p1 iax + p w again 
without changing the value of C%. Mixing pj with the 

corresponding classical correlation will lead to viola- 
tions of BC^ and therefore to the same conclusions as 
before. It is important to note that the characterization 
of the noncontextual/local set is achieved with n entropic 
inequalities, as opposed to the 2" -1 linear inequalities 



(HJ), however at the cost of introducing a list of classical 
points Pq . 



IV. CONCLUSION 

In principle, entropic inequalities only provide a nec- 
essary but not sufficient criterion for noncontextuality 
and local realism. However, we have shown that for the 
n-cycle with dichotomic outcomes, entropic inequalities 
turn also to be sufficient, since any contextual/nonlocal 
probabilistic model will display entropic violations if 
properly mixed with a classical model. It is quite surpris- 
ing that a polynomial number of non-linear and non-tight 
inequalities may completely characterize the set of non- 
contextual/local correlations, that otherwise would re- 
quire an exponential number of linear and tight inequal- 
ities to do so. 

One obvious question is how this result would extend 
for more complex scenarios, involving more than two- 
outcomes and possibly more parties as in a multipartite 
Bell test. Even in the bipartite case, n = 4 for the n- 
cycle, the complete characterization of the local correla- 
tions is not known for a number of outcomes larger than 3 
Could it be that the BC entropic inequality aug- 
mented with shared randomness fully characterizes the 
n-cycle for a general number of outputs? Another inter- 
esting question is to understand the role of entropic in- 
equalities in the bilocality scenario, where independence 
constraints define a nonconvcx set, difficult to character- 



ize in the probability space [15 1. 

Finally, violations of linear Bell inequalities can be un- 
derstood as a resource, for instance, allowing for higher 
probability of success in some information tasks Q. Is 
there any operational interpretation for the violation of 
an entropic inequality in terms of a relevant physical 
task? If that turns out to be the case, an interesting sce- 
nario would arise, where nonlocal but entropically clas- 
sical correlations could be turned into a useful resource, 
being activated by the use of shared randomness. 



Appendix: Violation of the entropic inequalities 
without the depolarization procedure 

We show here that given a general nonlocal probability 
distribution in the CHSH scenario, the mixing with the 
classical correlation is sufficient to violate the entropic 
inequality, without the need of the depolarization proce- 
dure. 

The local set consists of 16 extremal points P^' 7 ' 5 



parameterized as 



Pdet (ab\xy) 



a = ax © (3, 

b = 72/ © 6 
otherwise 



(A.l) 



and all the 8 nonlocal extremal points Pp^' 7 of the 
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nonsignalling set can be parameterized as 



/ ,i \ f 1/2 , a © b = xy + ax + By + 7 
p PR (ab\xy) = { Q ' j0thcrwise 

. (A.2) 

here a, /?, 7, (5 £ {0,1}. To simplify the description we 
have employed the common notation to the CHSH sce- 
nario, that is, X\ and X3 corresponding to x = and 
x = 1 and X2 and X4 corresponding to y = and y = 1, 
while a and 6 label the corresponding outcomes. For a 
general distribution written as a convex combination of 
all extreme points 



^QpR° ~ ^QpR 1 > 1- Mixing the p (<z&|xy) with the classi- 
cal correlated box as before and expanding around v = 0, 
one finds that BC 4 = (w/ln4) [5 + 21nw(l - C 4 )], that 
violates the entropic inequality for C4 > 1 (g is a func- 
tion of p (a6|xy) but independent of v). 

The same argument can be applied to the general n- 
cycle, and we have tested up to n = 7 that the same result 
holds. For larger n, the procedure becomes unfeasible 
given that the number of extremal points increases expo- 
nentially, but we conjecture that the same result should 
hold for any n. 
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a,/3,7,<5 



(A.3) 

the condition for the violation of the CHSH inequal- 
ity is that d = (-l^-^+^^+^f^ + 

a,/3,7,<5 
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